Sparse sampling heuristic search

نویسندگان

  • Maarten Tromp
  • Bastiaan de Groot
  • Nikos Vlassis
چکیده

In the article A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes, Kearns et al show that there’re now theoratical boundary’s to solve an mdp with an infinite state space in a such a way that the running time has no dependency’s on the size of the statespace. To come to this conclusion they introduce an algorithm that uses a sparse look-a-head tree to select a near optimal action. The algorithm is however exponentional in the horizon time (which depends on the discount factor λ and the desired aproximation), in there article they leave as an open question the possibility that there might be an algorithm which given a generative model (a very common kind of simulator) can come up with an near-optimal solution. In this report whe show that such an algorithm can’t use the hoeffding-bound to estimate his chances of being wrong. Another problem for practical implementation is that altough the amount of samples needed may not depend on the size of the state-space it still can get very high, we therefore have combined the original algorithm of Kearns et al. with a heuristic wich points out were to sample next based on the results of the samples already taken. Altough in the worst case we need to take exactly the amount of sample Kearns et al. need, it’s quite easy to think of mdp’s in wich this heurisitc will save you from taking a lot of not needed samples without loosing any certainty on your aproximation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Heuristic Search for Sparse Motion Planning Data Structures

Sampling-based methods provide efficient, flexible solutions for motion planning, even for complex, highdimensional systems. Asymptotically optimal planners ensure convergence to the optimal solution, but produce dense structures. This work shows how to extend sparse methods achieving asymptotic near-optimality using multiple-goal heuristic search during graph constuction. The resulting method ...

متن کامل

Heuristic Search and Receding-Horizon Planning in Complex Spacecraft Orbit Domains

Spacecraft missions to small celestial bodies face sensitive, strongly non-Keplerian dynamics that motivate the employment of automated sampling-based trajectory planning. However, the scarcity of onboard computing resources necessitates careful formulation of heuristics for efficiently searching the reachable sets, which exhibit complex and finely-detailed structure. We examine a global search...

متن کامل

Abstraction-Guided Sampling for Motion Planning

ion-guided Sampling for Motion Planning University of New Hampshire Department of Computer Science Technical Report 12-01 Scott Kiesel, Ethan Burns and Wheeler Ruml May 1, 2012 Abstraction-guided Sampling for Motion Planningion-guided Sampling for Motion Planning Scott Kiesel and Ethan Burns and Wheeler Ruml Abstract Motion planning in continuous space is a fundamental robotics problem that has...

متن کامل

Multi-agent RRT*: Sampling-based Cooperative Pathfinding (Extended Abstract)

Cooperative pathfinding is a problem of finding a set of non-conflicting trajectories for a number of mobile agents. Its applications include planning for teams of mobile robots, such as autonomous aircrafts, cars, or underwater vehicles. The state-of-the-art algorithms for cooperative pathfinding typically rely on some heuristic forward-search pathfinding technique, where A* is often the algor...

متن کامل

Bootstrapping Simulation-Based Algorithms with a Suboptimal Policy

Finding optimal policies for Markov Decision Processes with large state spaces is in general intractable. Nonetheless, simulation-based algorithms inspired by Sparse Sampling (SS) such as Upper Confidence Bound applied in Trees (UCT) and Forward Search Sparse Sampling (FSSS) have been shown to perform reasonably well in both theory and practice, despite the high computational demand. To improve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005